Computers Seeing Action 1 Seeing Action

نویسنده

  • Aaron F. Bobick
چکیده

As research in computer vision has shifted from only processing single, static images to the manipulation of video sequences, the concept of action recognition has become important. Fundamental to understanding action is reasoning about time, in either an implicit or explicit framework. In this paper I describe several spe-ciic examples of incorporating time into representations of action and how those representations are used to recognize actions. The approaches diier on whether variation over time is considered a continuous mapping, a state-based trajectory, or a qualitative, semantically labeled sequence. For two of the domains | whole body actions and hand gestures | I describe the approaches in detail while two others | constrained semantic domains (e.g. watching someone cooking) and labeling dynamic events (e.g. American football) | are brieey mentioned. Understanding video sequences is diierent than conventional image understanding in that one is interested in what is happening in a scene, as opposed to what is in the scene. One might believe that attempting to describe what is happening in hundreds of images is not a viable research of goal given the diiculty of understanding just one picture. However, video understanding can be regarded as a way of providing more constraint in the interpretation of imagery. We require that the image interpretation be plausible over time: extracted structure must obey the temporal constraints of the domain. For example, if we are annotating an American football play, we might be interested in tracking the quarterback. Unfortunately, current (even near future) technology cannot see or track the quarterback in every frame. However, assuming he never disappears from the eld of play, we can \track" him as he enters an amorphous blob and re-emerges six frames later. The program cannot see him during this time, but it knows he's there. Understanding time can be either explicit, as in the above example, or implicit, captured in the representation of action. One example that we will expand upon later is our work in gesture recognition 3, 22]. In this work gesture is represented either deterministically by an explicit sequence of states through which the hand must move, or probabilistically by a hidden Markov model. In both cases the requirement that the interpretation be consistent with the temporal constraints of the domain is guaranteed by matching the input data to learned representations of action which are sensitive to time. From our perspective, one of the future directions …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

“Seeing” the Difference: The Importance of Visibility and Action as a Mark of “Authenticity” in Co-production; Comment on “Collaboration and Co-production of Knowledge in Healthcare: Opportunities and Challenges”

The Rycroft-Malone paper states that co-production relies on ‘authentic’ collaboration as a context for action. Our commentary supports and extends this assertion. We suggest that ‘authentic’ co-production involves processes where participants can ‘see’ the difference that they have made within the project and beyond. We provide examples including: the use of design in health projects which see...

متن کامل

Perception, Mobility and the Affordances of Portable Computers: An Historical Epistemology of Mobile Computing

The interaction between humans and portable technologies, as a part of the broader problem of technology use in Human-Computer Interaction, has received considerable research attention and has been approached from diverse angles. The concept of affordance, drawn from ideas in the psychology of perception, is one of these angles. Unfortunately, most of these theories of perception which premise ...

متن کامل

Comparison of achievement of educational objectives in prosthodontics department of Shahed Dental School according to approved 2001 and 2013 curricula

Background and Objective: The educational process will be useful if the goals for which they are attained are fulfilled during or after the course. The aim of this study was to evaluate the achievement of the educational goals of the Prosthodontics Department of Shahed Dental School in two curricula.   Materials and Methods: This cross-sectional study evaluated two dental curricula using CIPP...

متن کامل

Controlling attention through action: observing actions primes action-related stimulus dimensions.

Previous findings suggest that planning an action "backward-primes" perceptual dimension related to this action: planning a grasp facilitates the processing of visual size information, while planning a reach facilitates the processing of location information. Here we show that dimensional priming of perception through action occurs even in the absence of active action planning. Subjects watched...

متن کامل

طراحی و ساخت دیدسنج نجومی دیفرانسیلی

  Which place is an appropriate site for the construction of the Iranian National Observatory (INO)? In this paper part of the site selection process is reported. The emphasis is on the measuring of the seeing parameter for the pre-selected regions. These regions are examined by meteorological and geophysical studies and finally Kashan, Kerman, Qom and Ferdows sites were selected among 31 regio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996